Modeling code-Switching speech on under-resourced languages for language identification
نویسندگان
چکیده
This paper presents an integration of phonotactic information to perform language identification (LID) in a mixed-language speech. A single-pass front-end recognition system is employed to convert the spoken utterances into a statistical occurrence of phone sequences. To process such phone sequences, a hidden Markov model (HMM) is utilized to build robust acoustic models that can handle multiple languages within an utterance. A supervised Support Vector Machine (SVM) learns the language transition of the phonotactic information given the recognized phone sequences. The back-end SVM-based decision classifies language identity given the likelihood scores phone occurrences. The experiments are conducted on commonly mixed-language Northern Sotho and English speech utterances. We evaluate the system measuring the performance of the phone recognition and LID portions separately. We obtained a phone error rate of 15.7% when a data-driven phoneme mapping approach is modeled with 16 Gaussian mixtures per state. However, the proposed integrated LID system has achieved a considerable performance with an acceptable LID accuracy of 85.0% and average of 81% on code-switched speech and monolingual speech segments respectively. Index Terms Code-switching speech, under-resourced languages, phonotactic information, acoustic models, language model
منابع مشابه
Language identification of code Switching sentences and multilingual sentences of under-resourced languages by using multi structural word information
Language identification (LID) is a process to identify the languages used in a text or speech. Code switching is the switching of a language in a sentence or speech utterance. This paper focuses on LID of words in code switching sentences. Code switching can occur intersentential or intrasentential. The reasons why a writer switches from one language to another due to various reasons and among ...
متن کاملCode-Switching speech recognition for closely related languages
This work presents an approach to recognition of multispeaker conversational speech with code-switching between Ukrainian and Russian languages. Both inter-sentential and intra-sentential code-switching is handled. The approach takes into account peculiarities of phonetic systems of the closely related Russian and Ukrainian languages. A crosslingual LVCSR system is developed. The acoustic model...
متن کاملLanguage Identification for Under-Resourced Languages in the Basque Context
Automatic Speech Recognition (ASR) is a broad research area that absorbs many efforts from the research community. The interest on Multilingual Systems arouses in the Basque Country because there are three official languages (Basque, Spanish, and French), and there is much linguistic interaction among them, even if Basque has very different roots than the other two languages. The development of...
متن کاملEfficient Acoustic Modeling Method for Unsupervised Speech Recognition using Multi-Task Deep Neural Network
This paper proposes a method of acoustic modeling for zero-resourced languages speech recognition under mismatch conditions. In those languages, very limited or no transcribed speech is available for traditional monolingual speech recognition. Conventional methods such as IPA based universal acoustic modeling has been proved to be effective under matched acoustic conditions (similar speaking st...
متن کاملIntroduction to the special issue on processing under-resourced languages
The creation of language and acoustic resources, for any given spoken language, is typically a costly task. For example, a large amount of time and money is required to properly create annotated speech corpora for automatic speech recognition (ASR), domain-specific text corpora for language modeling (LM), etc. The development of speech technologies (ASR, Text-to-Speech) for the already highreso...
متن کامل